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October 19, 2023 at 09:39 


1. Intro. This is a modification of the program SSXCC-BINARY by Donald Knuth, extending it to take 
into account multiplicities for the items. For the multiplicities, the input format is the same as the one 
defined in the program DLX3. 

This program is another experiment in the use of so-called sparse-set data structures instead of the dancing 
links. It is written as if living on a planet where the sparse-set ideas are well known, but doubly linked links 
are almost unheard-of. 

I suggest that you read SSXCC, SSXCC-BINARY and DLX3 first. 

After this program finds all solutions, it normally prints their total number on stderr, together with 
statistics about how many nodes were in the search tree, and how many “updates” were made. The running 
time in “mems” is also reported, together with the approximate number of bytes needed for data storage. 
(An “update” is the removal of an option from its item list, or the removal of a satisfied color constraint 
from its option. One “mem” essentially means a memory access to a 64-bit word. The reported totals don’t 
include the time or space needed to parse the input or to format the output.) 


##define o mems++ /* count one mem */ 

#-define 00 mems += 2 /* count two mems */ 

7##define 000 mems += 3 /* count three mems */ 

##define subroutine.overhead mems += 4 

#define O "/" /* used for percent signs in format strings «/ 
#define mod % /* used for percent signs denoting remainder in C */ 


##define maz_stage 500 /* at most this many options in a solution */ 

##define maz_level 32000 /* at most this many levels in the search tree */ 

#define maz_cols 100000 /* at most this many items */ 

7##define maz_nodes 10000000 /* at most this many nonzero elements in the matrix */ 
7##define savesize 10000000 /* at most this many entries on savestack */ 

##define bufsize (9 * maz_cols + 3) /* a buffer big enough to hold all item names «/ 


7##define show_basics 1 /* vubose code for basic stats; this is the default */ 

#define show_choices 2 /* vbose code for backtrack logging */ 

##define show_details 4 /* vubose code for further commentary */ 

#define show_record_weights 16 /* vbose code for first time a weight appears */ 
#define show_weight_bumps 32 /* ubose code to show new weights */ 

#define show_final_weights 64 /* vubose code to display weights at the end «/ 
#define show_profile 128 /* ubose code to show the search tree profile «/ 

#define show_fullstate 256 /* vbose code for complete state reports */ 

7##define show_tots 512 /* vubose code for reporting item totals at start */ 

#define show_warnings 1024 /* vubose code for reporting options without primaries */ 
#define show_mazdeg 2048 /* vbose code for reporting maximum branching degree */ 
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2. Here is the overall structure: 


#include <stdio.h> 

#include <stdlib.h> 

#include <string.h> 

#include <ctype.h> 

#include "gb_flip.h" 
typedef unsigned int uint; /* a convenient abbreviation */ 
typedef unsigned long long ullng; /* ditto */ 


(Type definitions 8 ); 
(Global variables 3); 
(Subroutines 6); 


int main(int argc,char *argv[]) 
{ 
register int c, cc, i, 7, k, p, pp, q, T, 8, t, cur-choice, cur_node, best_itm, istage, score, best_s, 
best_l; 


(Process the command line 4); 
(Input the item names 15); 
(Input the options 19); 
if (vbose & show_basics) (Report the successful completion of the input phase 26 ); 
if (vbose & show_tots) {Report the item totals 27); 
imems = mems,mems = 0; 
if (baditem) (Report an uncoverable item 25 ) 
else { 
if (randomizing) (Randomize the item list 28 ); 
(Solve the problem 31); 
} 
done: if (vbose & show_profile) (Print the profile 51); 
if (vbose & show_final_weights) { 
fprintf (stderr, "Final,weights:\n"); 
print_weights ( ); 
} 
if (vbose & show_max_deg) fprintf (stderr, "The,maximum best_itm sizewas,"O"d.\n", maxdeg); 
if (vbose & show_basics) { 
forintf (stderr, "Altogether,"O"1llu,solution"O"s,,,"O"llut+"O"llu_mems,", count, 
count =1?"":"s", imems, mems); 
bytes = (itemlength + setlength) * sizeof (int) + last_node « sizeof 
(node) + 2 * mazl * sizeof (int) + mazsaveptr * (sizeof (twoints) + sizeof (int )); 
forintf (stderr, "\"O"1llu updates, "O"1lu_bytes,."O"1lunodes,", updates, bytes , nodes); 
forintf (stderr, "\Wccosty"O"11d%%.\n", mems ? (200 * cmems + mems)/(2 * mems) : 0); 
} 
if (sanity_checking) fprintf (stderr, "sanity_checking,,was_jon! \n"); 
(Close the files 5); 
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3. You can control the amount of output, as well as certain properties of the algorithm, by specifying 
options on the command line: 


e ‘v(integer )’ enables or disables various kinds of verbose output on stderr, given by binary codes such as 
show-choices ; 

e ‘m( integer )’ causes every mth solution to be output (the default is mO, which merely counts them); 

e ‘s(integer )’ causes the algorithm to randomize the initial list of items (thus providing some variety, 
although the solutions are by no means uniformly random); 

e ‘d(integer )’ sets delta, which causes periodic state reports on stderr after the algorithm has performed 
approximately delta mems since the previous report (default 10000000000); 

e ‘c( positive integer )’ limits the levels on which choices are shown during verbose tracing; 

e ‘C( positive integer )’ limits the levels on which choices are shown in the periodic state reports; 

‘1(nonnegative integer )’ gives a lower limit, relative to the maximum level so far achieved, to the levels 

on which choices are shown during verbose tracing; 

‘t (positive integer )’ causes the program to stop after this many solutions have been found; 

‘T( integer )’ sets timeout (which causes abrupt termination if mems > timeout at the beginning of a level); 

‘w( float )’ is the initial increment dw added to an item’s weight (default 1.0); 

‘W( float )’ is the factor by which dw changes dynamically (default 1.0); 

‘S( filename )’ to output a “shape file” that encodes the search tree. 


Global variables 3) = 

int random_seed = 0; /* seed for the random words of gb_rand */ 

int randomizing; /* has ‘s’ been specified? */ 

int vbose = show_basics + show_warnings; /* level of verbosity */ 

int spacing; /* solution k is output if k is a multiple of spacing */ 

int show_choices.maxz = 1000000; /* above this level, show_choices is ignored */ 

int show_choices.gap = 1000000; /* below level mazl — show-choices.gap, show_details is ignored */ 
int show_levels.max = 1000000; /* above this level, state reports stop */ 

int mac; /* maximum level actually reached */ 

int mazs; /* maximum stage actually reached */ 

int mazsaveptr; /* maximum size of savestack */ 

char buf [bufsize]; /* input buffer «/ 

ullng count; /* solutions found so far «/ 

ullng options; /* options seen so far */ 

ullng imems, mems, tmems, cmems; /* mem counts */ 

ullng updates; /* update counts «/ 

ullng bytes; /* memory used by main data structures */ 

ullng nodes; /* total number of branch nodes initiated «/ 

ullng thresh = 10000000000; /* report when mems exceeds this, if delta £0 */ 
ullng delta = 10000000000; /* report every delta or somems */ 

ullng maxcount = *ffffffff{ffff{fifFf; /* stop after finding this many solutions */ 
ullng timeout = *1fffffffffffffff; /* give up after this many mems */ 

float w0 = 1.0, dw = 1.0, dwfactor = 1.0; /* initial weight, increment, and growth */ 
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float mazwt = 1.0; /* largest weight seen so far */ 

FILE xshape_file ; /* file for optional output of search tree shape */ 
char *shape_name; /* its name */ 

int mazdeg; /* the largest branching degree seen so far */ 


See also sections 9, 30, and 32. 


This code is used in section 2. 
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4. If an option appears more than once on the command line, the first appearance takes precedence. 
(Process the command line 4) = 
for (j = argc —1,k =0; 7; j——) 
switch (argv [j][0]) { 


case ’v’: k |= (sscanf (arguv|j] + 1,""O"d", &ubose) — 1); break; 

case ’m’: k |= (sscanf (argv|[j] +.1,""O"d", &spacing ) — 1); break; 

case ’s’: k |= (sscanf (arguv|j] + 1,""O"d", &random_seed) — 1), randomizing = 1; break; 
case ’d’: k |= (sscanf (arguv[j] +1,""O"11d", &delta) — 1), thresh = delta; break; 
case ’c’: k |= (sscanf (argu[j] + 1,""O"d", &show_choices.maz ) — 1); break; 
case ’C’: k |= (sscanf (argu[j] + 1,""O"d", &show_levels.maz ) — 1); break; 

case ?1’: k |= (sscanf (argu[j] + 1,""O"d", &show_choices_gap ) — 1); break; 
case ’t’: k |= (sscanf (argu[j] + 1,""O"11d", &maxcount) — 1); break; 

case ’T’: k |= (sscanf (argv[j] + 1,""O"11d", &timeout) — 1); break; 

case ’w’: k |= (sscanf (arguv[j] +1,""O"£", &dw) — 1); break; 

case ’W’: k |= (sscanf (arguv[j] + 1,""O"£", &dwfactor) — 1); break; 

case ’S’: shape_name = argv[j] + 1, shape_file = fopen (shape_name, "w"); 


if (ashape_file ) 
fprintf (stderr, "Sorry, ,Ijcan’ tyopen file, ‘"O"s’ for writing! \n", shape.name); 
break; 
default: k = 1; /* unrecognized command-line option */ 
} 
if (k) { 
fprintf (stderr, "Usage: ,"O"s,[v<n>] ,[m<n>] , [s<n>] , [d<n>] ""[c<n>] , [C<n>] , [1<n\ 
>] ,[t<n>] | [T<n>] | [w<f>] |, [W<f>] |, [S<bar>] \<_,f00.d1x\n", argv [0]); 
exit (—1); 
} 
if (randomizing) gb-init_rand (random_seed ); 


This code is used in section 2. 


5. (Close the files 5) = 
if (shape_file) fclose (shape_file ); 


This code is used in section 2. 


6. Here’s a subroutine that I hope is never invoked (except maybe when I’m debugging). 


(Subroutines 6) = 
void confusion(char *m) 


forintf (stderr, ""O"s!\n",m); 


See also sections 11, 12, 138, 14, 36, 42, 48, 49, and 50. 


This code is used in section 2. 


84 


87 SSMCC DATA STRUCTURES i) 


7. Data structures. Sparse-set data structures were introduced by Preston Briggs and Linda Torczon 
[ACM Letters on Programming Languages and Systems 2 (1993), 59-69], who realized that exercise 2.12 
in Aho, Hopcroft, and Ullman’s classic text The Design and Analysis of Computer Algorithms (Addison— 
Wesley, 1974) was much more than just a slick trick to avoid initializing an array. (Indeed, TAOCP exercise 
2.2.6—24 calls it the “sparse array trick.” ) 

The basic idea is amazingly simple, when specialized to the situations that we need to deal with: We can 
represent a subset S of the universe U = {2,%1,...,%n—1} by maintaining two n-element arrays p and gq, 
each of which is a permutation of {0,1,...,n — 1}, together with an integer s in the range 0 <s <n. In 
fact, p is the inverse of g; and s is the number of elements of $. The current value of the set S' is then simply 
{Lpo.+-+,Lp,_,}. (Notice that every s-element subset can be represented in s!(n — s)! ways.) 

It’s easy to test if x, € S, because that’s true if and only if gq, < s. It’s easy to insert a new element x, 
into S: Swap indices so that p, = k, q, = s, then increase s by 1. It’s easy to delete an element x; that 
belongs to S: Decrease s by 1, then swap indices so that p, = k and q, = s. And so on. 

Briggs and Torczon were interested in applications where s begins at zero and tends to remain small. 
In such cases, p and q need not be permutations: The values of ps, ps41, ---,; Pn—1 Can be garbage, and 
the values of gq, need be defined only when x, € S. (Such situations correspond to the treatment by Aho, 
Hopcroft, and Ullman, who started with an array full of garbage and used a sparse-set structure to remember 
the set of nongarbage cells.) Our applications are different: Each set begins equal to its intended universe, 
and gradually shrinks. In such cases, we might as well maintain inverse permutations. The basic operations 
go faster when we know in advance that we aren’t inserting an element that’s already present (nor deleting 
an element that isn’t). 

Many variations are possible. For example, p could be a permutation of {xo,21,...,2%n—1} instead of a 
permutation of {0,1,...,n —1}. The arrays that play the role of g in the following routines don’t have 
indices that are consecutive; they live inside of other structures. 
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8. This program has an array called item, with one entry for each item. The value of item[k] is an 
index x into a much larger array called set. The set of all options that involve the kth item appears in that 
array beginning at set[x]; and it continues for s consecutive entries, where s = size(a) is an abbreviation 
for set|a — 1]. If ttem[k] = x, we maintain the relation pos(x) = k, where pos(x) is an abbreviation for 
set[z — 2]. Thus item plays the role of array p, in a sparse-set data structure for the set of all currently 
active items; and pos plays the role of q. 

A primary item x also has a wt field, set[a — 5], initially 1. The weight is increased by dw whenever 
we backtrack because x cannot be covered. (Weights aren’t actually used in the present program; that will 
come in extensions to be written later. But it will be convenient to have space ready for them in our data 
structures, so that those extensions will be easy to write.) 

And finally, we have the bound and slack fields, set |x —6] and set|[a —7]. These are used in the same way 
as in DLX3 to keep track of the item’s multiplicities. 

Suppose the kth item x currently appears in s options. Those options are indices into nd, which is an 
array of “nodes.” Each node has three fields: itm, loc, and clr. Ifa <q < a+, let y = set|g]. This is 
essentially a pointer to a node, and we have nd[y].ttm = x, nd[y].loc = q. In other words, the sequential 
list of s elements that begins at « = item|k] in the set array is the sparse-set representation of the currently 
active options that contain the kth item. The clr field nd[y].clr contains x’s color for this option. The itm 
and clr fields remain constant, once we’ve initialized everything, but the loc fields will change. 

The given options are stored sequentially in the nd array, with one node per item, separated by “spacer” 
nodes. If y is the spacer node following an option with ¢ items, we have nd[y].itm = —t. If y is the spacer 
node preceding an option with t items, we have nd[y].loc = t. 

This probably sounds confusing, until you can see some code. Meanwhile, let’s take note of the invariant 
relations that hold whenever k, g, x, and y have appropriate values: 


pos(item|k]) =k; nd[set[q]].loc =q; item[pos(x)] =x; set|nd[y].loc] = y. 


(These are the analogs of the invariant relations p[q[k]] = q[p[k]] = & in the simple sparse-set scheme that 
we started with.) 

The set array contains also the item names. 

We count one mem for a simultaneous access to the itm and loc fields of a node. Each node actually has 
a “spare” fourth field, spr, inserted solely to enforce alignment to 16-byte boundaries. (Some modification 
of this program might perhaps have a use for spr?) 


#define size(x) set[(x) —1].2 /* number of active options of the kth item, x */ 

#define pos(x) set|(x) — 2].i /* where that item is found in the item array, k */ 

#define Iname(x) set[(x) — 4].2 /* the first four bytes of ’s name */ 

#define rname(x)  set|(x) — 3].2 /* the last four bytes of x’s name */ 

#define slack(x) set{(x) — 5].i /* if multiplicity [u..v], slack is equal to v-u and does not change */ 
#define bound(x) set|(x) — 6].i /* residual capacity of this item */ 

#define wit(x) set[(x) — 7].f /* the current floating-point “weight” of x */ 


##define primeztra 7 /* this many extra entries of set for each primary item */ 
7##define secondeztra 4 /* and this many for each secondary item */ 
#define mazextra 7 /* maximum of primeztra and secondextra */ 


#tdefine ipropcount 6 /* the number of bytes used for each item in the input phase */ 
(Type definitions 8) = 
typedef struct node-_struct { 


int itm; /* the item x corresponding to this node */ 

int loc; /* where this node resides in x’s active set */ 

int clr; /* color associated with item x in this option, if any */ 

int spr; /* a spare field inserted only to maintain 16-byte alignment */ 
} node; 


typedef union { 
int i; /* an integer (32 bits) «/ 
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float f; /* a floating point value (fits in 4 bytes) */ 
} tetrabyte; 


See also section 10. 


This code is used in section 2. 


9. (Global variables 3) += 


node nd[maz_nodes]; /* the master list of nodes */ 

int last_node; /* the first node in nd that’s not yet used */ 

int item|maz_cols]; /* the master list of items */ 

int second = maz_cols; /* boundary between primary and secondary items */ 
int last_itm; /* items seen so far during input, plus 1 */ 

tetrabyte set|maz_nodes + mazextra * maz-_cols]|; /* active options for active items */ 
int itemlength; /* number of elements used in item */ 

int setlength; /* number of elements used in set */ 

int active; /* current number of active items */ 

int oactive; /* value of active before swapping out current-choice items */ 

int baditem; /* an item with no options, plus 1 */ 

int osecond; /* setting of second just after initial input */ 

int force|maz_cols]; /* stack of items known to have size equal to bound-slack */ 
int forced; /* the number of items on that stack «/ 


10. We're going to store string data (an item’s name) in the midst of the integer array set. So we’ve got 
to do some type coercion using low-level C-ness. 
(Type definitions 8) += 

typedef struct { 


int /, 7; 

} twoints; 

typedef union { 
unsigned char str[8]; /* eight one-byte characters */ 
twoints Ir; /* two four-byte integers */ 


} stringbuf; 
stringbuf namebuf ; 


11. (Subroutines 6) += 
void print_item_name (int k, FILE xstream) 


namebuf .lr.l = Iname(k), namebuf .lr.r = rname(k); 
forintf (stream, "\"O" .88", namebuf .str); 


} 
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12. An option is identified not by name but by the names of the items it contains. Here is a routine that 
prints an option, given a pointer to any of its nodes. It also prints the position of the option in its item list. 
(Subroutines 6) += 

void print_option (int p, FILE *stream,int showpos ) 


{ 


register int k, q, 2; 


x = nd[p].itm; 

if (p > last.node Va <0) { 
forintf (stderr, "Illegaloption,"O"d!\n", p); 
exit (1); 
return; 


for (q=p;; ) { 
print_item_name(a, stream); 
if (nd[q].clr) fprintf (stream, ":"O"c", nd[q].clr); 


x = nd{q].itm; 
if (1 <0) q+=2,2 = nd{q].itm; 
if (¢ =p) break; 


: = nd|qj.loc; 
if (showpos > 0) fprintf (stream, "4("O"dyof,"O"d)\n",k — a +1, size(x)); 
else if (showpos =0) fprintf (stream,"\n"); 

} 

void prow (int p) 


{ 


print_option(p, stderr, 1); 


13. When I’m debugging, I might want to look at one of the current item lists. 
(Subroutines 6) += 
void print_itm (int c) 
{ 
register int p; 
if (c < primextra V c > setlength V pos(c) < 0V pos(c) > itemlength V item|pos(c)| #c) { 
forintf (stderr, "Illegalitem,"O"d!\n",c); 
return; 
i 
forintf (stderr, "Item,("O"d)",c); 
print.item_name(c, stderr); 
if (c < second) { 
if (slack(c) V bound(c) £1) fprintf(stderr, "\("O"d,"O"d)", bound(c) — slack (c), bound (c)); 
forintf (stderr, "WC"O"d of y"O"d) , ylength"O"d, wweight,"O".1£:\n", pos(c) + 1, active, 
size(c), wt(c)); 


else if (pos(c) > active) 

fprintf (stderr, "\,(secondary,"O"d, jpurified) ,length,"O"d:\n", pos(c) + 1, size(c)); 
else fprintf(stderr, "\(secondary,"O"d) , ,length,"O"d:\n", pos(c) + 1, size(c)); 
for (p=c; p< c+ size(c); p++) prow(set[p].i); 
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14. Speaking of debugging, here’s a routine to check if redundant parts of our data structure have gone 
awry. 
7##define sanity_checking 0 /* set this to 1 if you suspect a bug */ 
(Subroutines 6) += 
void print_items() 


for (int i = 0; i < itemlength; i++) print_itm (item [i]); 
} 
void sanity () 
{ 
register int k, x, 7, 1, r, q, @q;3 
int ok = 1; 
for (k = 0; k < itemlength; k++) { 
x = item|k); 
if (pos(x) #k) { 
fprintf (stderr, "Bad.posyfield of item"); 
print.item_name (a, stderr); 
fprintf (stderr, "yC"O"dy!=,"O"d, s"O"d) !\n", k, pos (x), x); 
ok = 0; 
} 
} 
for (i =0; i < last_node; i++) { 
1 = nd{i].ctm,r = nd{i].loc; 
if (1<0) { 
if (nd[it+r+1].itm #¢ —-r) { 
forintf (stderr, "Bad spacerin,nodes,"O"d,y"O"d!\n", 7,4 +r +1); 
ok = 0; 
} 


qq = 0; 
} else { 
if (l>r) fprintf(stderr, "itm>loc in node,"O"d!\n", 2); 
else { 
if (set[r].1 41) { 

fprintf (stderr, "Bad locyfield for option,,"O"dofyitem",r —1+ 1); 
print_item_name (1, stderr); 
fprintf (stderr, "\yinunode,,"O"d! , set [r] .i="O"d\n", i, set[r].i); 
ok =0; 


if (pos(l) < active) { 

if (r <1+ size(l)) q=+1; else q=-—1; /* in or out? */ 

if (q* qq <0) { 
forintf (stderr, "Flipped statusatoption,,"O"d of item",r —1+ 1); 
print_item_name (I, stderr); 
forintf (stderr, "yin node,"O"d! ,4q,qq="O"d,"O"d\n", 2, g, 9q); 
ok = 0; 

} 

qd = 4 


‘ 
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15. Inputting the matrix. Brute force is the rule in this part of the code, whose goal is to parse and 
store the input data and to check its validity. 
We use only ipropcount entries of set per item while reading the item-name line. 
#define panic(m) 
{ fprintf(stderr, ""O"s!\n"O"d:4"O".99s\n",m, p, buf); exit (—666); } 
(Input the item names 15) = 
while (1) { 
if (fgets (buf , bufsize, stdin)) break; 
if (0, buf [p = strlen(buf) — 1] A ’?\n’) panic("Input,line,way,jtoo,long"); 
for (p = 0; 0, isspace(buf [p]); p++) ; 
if (buf [p] =’ |? V buf [p]) continue; /* bypass comment or blank line «/ 
lastitm = 1; 
break; 
} 
if (slast_itm) panic("No,items"); 
for (; 0, buf[p]; ) { 
o, namebuf .lr.l = namebuf .lr.r = 0; 
(Scan an item name, possibly prefixed by bounds 16); 
00, name (last_itm * ipropcount ) = namebuf .lr.l, rname (last_itm * ipropcount) = namebuf .Ir.r; 
0, slack (last_itm * tpropcount) = q — 1, bound (last_itm * ipropcount) = q; 
last_itm ++; 
if (last_itm > maz_cols) panic("Too,many,,items"); 
for (p += j +1; 0, isspace (buf [p]); p++) ; 
if (buf [pl =? 1?) { 
if (second 4 maz_cols) panic("Itemname,line,contains,,| twice"); 
second = last_itm; 
for (p++; o, isspace (buf [p]); p++) ; 
} 
} 


This code is used in section 2. 
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16. (Scan an item name, possibly prefixed by bounds 16) = 
if (second = masz_cols) istage =0; else istage = 2; 
start_name: for (j = 0; 7 < 8A (0, nisspace (buf [p + 3])); H+) { 
if (buf [p+ j)=?2?) { 
if (istage) panic("Illegal,,‘:’yin,item mame"); 
(Convert the prefix to an integer, q 17); 
r=q,tstage =1; 
goto start_name; 
} else if (buf[p+ 7] =’1’) { 
if (istage > 1) panic("Illegal,‘ |’ ,in,item name"); 
(Convert the prefix to an integer, g 17); 
if (q=0) panic("Upper_bound,is,zero" ); 
if (istage =0) r=q; 
else if (r > q) panic("Lower bound jexceeds,upper,bound"); 
istage = 2; 
goto start_name; 


} 


0, namebuf .str{j] = buf [p + J]; 


switch (istage) { 

case 1: panic("Lower,bound_without_upper,bound"); 

case 0: g=r=1; 

case 2: break; 

} 

if (j =0) panic("Itemnamejempty"); 

if (7 = 8A 7isspace (buf [p+ j])) panic("Item name, ,too,long"); 
(Check for duplicate item name 18); 


This code is used in section 15. 


17. (Convert the prefix to an integer, g 17) = 
for (q=0, pp =p; pp <p+Jj; pp++) { 
if (buf [pp] < °0’ V buf [pp] > °9’) panic("Illegal,digit,,in, bound spec"); 
q=10*q+ buf [pp] — 70’; 


p= pp +1; 
while (j) namebuf .str[——j] = 0; 


This code is used in section 16. 


18. (Check for duplicate item name 18) = 
for (k = last_itm — 1; k; k--) { 
if (0, name(k * ipropcount) 4 namebuf .Ir.l) continue; 
if (rname(k * ipropcount) = namebuf .Ir.r) break; 
if (k) panic("Duplicate,item mame" ); 


This code is used in section 16. 
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19. I’m putting the option number into the spr field of the spacer that follows it, as a possible debugging 
aid. But the program doesn’t currently use that information. 


(Input the options 19) = 
while (1) { 
if (fgets (buf, bufsize, stdin)) break; 
if (0, buf [p = strien(buf) — 1] A ?\n’) panic("Optionline,too,,long" ); 
for (p = 0; 0, isspace(buf [p]); p++) ; 
if (buf [p] =’ |? V 7buf [p]) continue; /* bypass comment or blank line «/ 
i = last_node; /* remember the spacer at the left of this option */ 
for (pp = 0; buf [pl]; ) { 
o, namebuf .lr.l = namebuf .lr.r = 0; 
for (j =0; 7 < 8A (0, 7isspace (buf [p+ j])) A buf [p+ 7] 4 ?:?3 J++) 0, namebuf .str[j] = buf [p+ 3]; 
if (=j) panic("Empty,item_name"); 
if (7 =8 A 7isspace (buf [p+ j]) A buf [p+ 7] 4 ?:’) panic("Item mame, too,,long"); 
(Create a node for the item named in buf [p] 20); 
if (buf [p+ 7] #’:’) 0, nd[last_node].clr = 0; 
else if (k > second) { 
if ((o, isspace (buf [p + 7 + 1])) V (0, misspace (buf [p + 7 + 2]))) 
panic("Color, must ,be,a,single,character"); 
o, nd |last_node|.clr = (unsigned char) buf [p+ + 1]; 
p += 2; 
} else panic("Primary,item must jbe,uncolored"); 
for (p += j +1; 0, isspace(buf [p]); p++) ; 


if (—pp) { 
if (vbose & show_warnings) fprintf (stderr, "Option,ignored,,(no_primary,items) :,,"O"s", buf); 
while (last_node >i) { 
(Remove last_node from its item list 21); 
last_node —; 


} 
} else { 


o, nd|i].loc = last_node — i; /* complete the previous spacer */ 
last_node ++; /* create the next spacer */ 

if (last_node = maa_nodes) panic("Too,many,modes"); 

options ++; 

o, nd [last_node].itm =1+ 1 — last_node; 

nd[last_node].spr = options; /* option number, for debugging only «/ 


} 


} 
(Initialize item 22); 
(Expand set 23); 
(Adjust nd 24); 


This code is used in section 2. 
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20. We temporarily use pos to recognize duplicate items in an option. 


(Create a node for the item named in buf [p] 20) = 
for (k = (last_itm — 1) * ipropcount; k > 0; k —= ipropcount) { 
if (0, Iname(k) A namebuf .lr.l) continue; 
if (rname(k) = namebuf .ir.r) break; 
} 
if (4k) panic("Unknown,item, name"); 
if (0, pos(k) > 7%) panic("Duplicate,item,mamein,this,option"); 


last_node-++; 
if (last_node = maa_nodes) panic("Too many modes"); 
0, t = size(k); /* how many previous options have used this item? */ 


o, nd [last_node].itm = k/ipropcount , nd[last_node].loc = t; 
f ((k/ipropcount) < second) pp = 1; 
0, size(k) =t + 1, pos(k) = last_node; 


This code is used in section 19. 


21. (Remove last_node from its item list 21) = 
0, k = nd[last_node].itm * ipropcount; 
00, size(k)——, pos(k) =i-—1; 


This code is used in section 19. 


22. (Initialize item 22) = 
active = itemlength = last_itm — 1; 
for (k = 0,7 = primeztra; k < itemlength; k++) 
oo, item|k] = j,j += (k+2 < second ? primeztra : secondextra) + size((k + 1) * tpropcount ); 
setlength = j — ipropcount; /* a decent upper bound x/ 
if (second = maz-cols) osecond = active, second = j; 
else osecond = second — 1; 


This code is used in section 19. 


§20 


23. Going from high to low, we now move the item names and sizes to their final positions (leaving room 


for the pointers into nb). 


(Expand set 23) = 
for (; k; k—) { 
0,j = item|k — 1]; 
if (k = second) second = J; /* second is now an index into set */ 
00, size(j) = size(k * tpropcount); 
iP Geen }) =0Ak < osecond) baditem = k; 

0, see j)=k-1,; 
00, rname(j) = poets * ipropcount ), Iname(j) = Iname(k * ipropcount ); 
oo, slack(j) = slack (k * ipropcount ), bound(j) = bound (k * ipropcount ); 
if (k < osecond) 0, wt(j) = w0; 


} 


This code is used in section 19. 
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24. (Adjust nd 24) = 
for (k =1; k < last_node; k++) { 


if (0, nd[k].itm <0) continue; /* skip over a spacer */ 
0,j = item|nd|[k].itm — 1]; 
i=j+4+nd{k].loc; /* no mem charged because we just read nd[k].itm */ 


o, nd|k].ttm = j, nd[k].loc = 4; 
0, set |i].i = k; 
} 


This code is used in section 19. 


25. (Report an uncoverable item 25) = 
{ 
if (vbose & show_choices) { 
forintf (stderr, "Item"); 
print_item_name (item|baditem — 1], stderr); 
forintf (stderr, "\jhasyno,options! \n"); 


} 


This code is used in section 2. 


26. The “number of entries” includes spacers (because DLX2 includes spacers in its reports). If you want 
to know the sum of the option lengths, just subtract the number of options. 


(Report the successful completion of the input phase 26) = 
forintf (stderr, "("O"11ld options, "O"d+"O"d items, .,"O"d entries,successfully,read) \n", 
options , osecond, itemlength — osecond, last_node); 


This code is used in section 2. 


27. The item lengths after input are shown (on request). But there’s little use trying to show them after 
the process is done, since they are restored somewhat blindly. (Failures of the linked-list implementation in 
DLX2 could sometimes be detected by showing the final lengths; but that reasoning no longer applies.) 


(Report the item totals 27) = 
{ 

fprintf (stderr, "Item,totals:"); 

for (k =0; k < itemlength; k++) { 
if (k = second) fprintf(stderr ,"|"); 
fprintf (stderr, "\\"O"d", size (item [k])); 

} 

fprintf (stderr, "\n"); 


} 


This code is used in section 2. 


28. (Randomize the item list 28) = 
for (k = active; k>1; ) { 
mems += 4,7 = gb_unif_rand(k); 
k--} 
00, 00,t = item|[j], item[j] = item|[k], item|k] = t; 
00, pos(t) = k, pos (item|[j]) = j; 
} 


This code is used in section 2. 
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29. Binary branching versus d-way branching. Nodes of the search tree in the previous program 
SSXCC, on which this one is based, are characterized by the name of a primary item 7 that hasn’t yet been 
covered. If that item currently appears in d options {01,...,0a}, node 7 has d children, one for each choice 
of the option that will cover 7. 

The present program, however, makes 2-way branches, and its nodes are labeled with both an item 7 and 
an option o. The left child of node (%,0) represents the subproblem in which 7 is covered by o, as before. 
But the right child represents the subproblem for which option o is removed but item 2 is still uncovered 
(unless d = 1, in which case there’s no right child). Thus our search tree is now rather like the binary tree 
that represents a general tree. (See The Art of Computer Programming, Section 2.3.2.) 

There usually is no good reason to do binary branching when we choose 7 so as to minimize d. On the 
right branch, 7 will have d — 1 remaining options; and no item 7’ will have fewer than d — 1. 

But this program is intended to provide the basis for other programs, which extend the branching heuristic 
by taking dynamic characteristics of the solution process into account. While exploring the left branch in 
such extensions, we might discover that a certain item 7’ is difficult to cover; hence we might prefer to branch 
on an option o’ that covers 7’, after rejecting o for item 2. 


30. We shall say that we’re in stage s when we’ve taken s left branches. We’ll also say, as usual, that we’re 
at level 1 when we’ve taken / branches altogether. 

Suppose, for instance, that we’re at level 5, having rejected 0; for 21, accepted og for i2, accepted o3 for is, 
rejected o4 for i4, and rejected os for is. Then we will have stage = 2, and choice{k] = o, for 0 < k < 5; 
here each o; is a node whose itm field is 7;. Also 


stagelevel 


stagelevel 
levelstage [0] = 
stagelevel 


I 


1, 
levelstage [1] = 2, 
levelstage [2] = 5. 
stagelevel 


0 

1 

2 
stagelevel [3] = 2 

4 

5 


stagelevel 


The option choice[k] has been accepted if and only if levelstage|stagelevel [k]] = k. 


(Global variables 3) += 
int stage; /* number of choices in current partial solution «/ 
int level; /* current depth in the search tree (which is binary) */ 
int choice |maz_level]; /* the option and item chosen on each level */ 
int deg|maz_level]; /* the number of options the item had at that time */ 
int levelstage|maz_stage]; /* the most recent level at each stage */ 
int stagelevel |maz_level]; /* the stage that corresponds to each level «/ 
ullng profile|maz_stage]; /* number of search tree nodes on each stage */ 
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31. The dancing. Our strategy for generating all exact covers will be to repeatedly choose an active 
primary item and to branch on the ways to reduce the possibilities for covering that item. And we explore 
all possibilities via depth-first search. 

The neat part of this algorithm is the way the sets are maintained. Depth-first search means last-in-first- 
out maintenance of data structures; and the sparse-set representations make it particularly easy to undo 
what we’ve done at deeper levels. 

The basic operation is “including an option.” That means (i) removing from the current subproblem all 
of the other options with which it conflicts, and (ii) considering all of its primary items to have their bounds 
decreased by 1. If this would make the bound of an item 0, we can make that item inactive. 


(Solve the problem 31) = 
{ 
level = stage = 0; 
forward: nodes++; 
if (vbose & show_profile) profile|stage|++; 
if (sanity_checking) sanity(); 
(Maybe do a forced move 41); 
(Save the currently active items and their sizes and bounds 45); 
(Do special things if enough mems have accumulated 33); 
(Set best_itm to the best item for branching, and let score be its branching degree 43); 
if (forced) { 
0, best_itm = force |—— forced]; 
(Do a forced move 47); 
} 
if (score <0) goto backup; 
if (score = inf_size) (Visit a solution and goto backup 44); 
advance: oo, choice [level] = cur_choice = set|best_itm].i; 
o, deg [level] = score; 
if (sinclude_option(cur-choice)) goto tryagain; 
(Increase stage 34); (Increase level 35); 
goto forward; 
tryagain: if (score =1) goto backup; 
if (vbose & show_choices) fprintf (stderr, "Backtracking,,in,stage,"O"d\n", stage); 
goto purgeit; 
backup: o, saveptr = saved| stage]; 
if (——stage <0) goto done; 
if (vbose & show_choices) fprintf (stderr, "Backtracking,tostage,"O"d\n", stage); 
o, level = levelstage [stage]; 
purgeit: if (0, deg[level] =1) goto backup; 
(Restore the currently active items and their sizes and bounds 46); 
0, cur_choice = choice [level]; 
(Remove the option cur-choice 39); 
(Increase level 35); 
goto forward; 


} 


This code is used in section 2. 
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32. We save the sizes of active items on savestack, whose entries have two fields / and r, for an item and 
its size. This stack makes it easy to undo all deletions, by simply restoring the former sizes. We also keep a 
stack boundsstack, to save and later restore the bounds. 


(Global variables 3) += 


int level; /* number of choices in current partial solution */ 
int choice [maz_level]; /* the node chosen on each level */ 
int saved|maz_level + 1]; /* size of savestack on each level */ 


twoints savestack |savesize]; 

int boundsstack |savesize]; 

int saveptr; /* current size of savestack and boundsstack */ 

int tough_itm; /* item whose set of options has just become empty */ 


33. (Do special things if enough mems have accumulated 33) = 
if (delta \ (mems > thresh)) { 
thresh += delta; 
if (vbose & show_full_state) print_state( ); 
else print_progress(); 
} 
if (mems > timeout) { 
fprintf (stderr, "TIMEOUT! \n"); goto done; 


This code is used in section 31. 


34. (Increase stage 34) = 
if (++stage > mags) { 
if (stage > maz_stage) { 
fprintf (stderr, "Too many,stages !\n"); 
exit (—40); 
} 
mars = stage; 


} 


This code is used in section 31. 


35. (Increase level 35) = 
if (++level > maal) { 
if (level > maz_level) { 
forintf (stderr, "Too,many,,levels!\n"); 
exit (—4); 


maal = level; 


oo, stagelevel [level] = stage, levelstage|stage] = level; 


This code is used in section 31. 


§36 


36. 
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The include_option routine extends the current partial solution, by hiding option opt. In addition, 


it will cover any primary items in opt if their bound after hiding opt becomes 0. The routine returns 0, 
however, if that would make some other primary item uncoverable. (In the latter case, tough_itm is set to 
the item that was problematic.) 
(Subroutines 6) += 

int include_option(int opt) 


{ 


register int c, optp, nn, nnp, ss, i, wi, p, pp, s; 
subroutine_overhead ; 
if (vbose & show_choices) { 

forintf (stderr, "S"O"d:", stage); 

print_option (opt, stderr, 1); 


} 


for (; 0, nd[opt — 1].ttm > 0; opt—) ; /* move to the beginning of the option */ 
for (; 0, (4 = nd[opt].itm) > 0; opt++) { 

pp = ndlopt].loc; /* where opt appears in ii’s set */ 

0, p = pos (it); /* where ii appears in item */ 


if (ii > second \ p > active) continue; 
(Cover or commit item ii, decrease bound of item ii if primary, potentially deactivate it, or return 
0 37); 
} 


return 1; 
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37. We need to remove the options that conflict with opt from the sets of their items. 


(Cover or commit item ii, decrease bound of item 7 if primary, potentially deactivate it, or return 0 37) = 
d 
if (ii < second A p < active) o, bound (ii)—; /* one mem charged on next line */ 
if (ii > second V (0, bound(i) =0)) { 
0, 88 = size(it); 
if (ii < second) c=0; 
else o,c = nd[opt].clr; 
for (s = ti +ss —1; s > ti; s—) 
if (s # pp) { 
0, optp = set|s].i; 
if (c=0V (0, nd[optp].clr #c)) (Remove optp from its other sets, or return 0 38); 


0, p = pos(it); 
0, Wt = item|—— active]; 
00, item|active] = it, item|[p] = iii; 
00, pos (ii) = active, pos (tit) = p; 
} 
else { 
0, 88 = size(tt) — 1; 
if (00, ss < bound(t) — slack(i)) { 
if ((vbose & show_details) A level < show_choices_maz / level > maal — show_choices_gap) { 
forintf (stderr, "\jcan’ tycover"); 
print_item_name (item [ii], stderr); 
forintf (stderr, "\n"); 


tough_itm = tt; 
forced = 0; 
return 0; /* abort the deletion, lest ii be wiped out */ 


if (ss =0) { /* Just deactivate the item, no hiding needed */ 
0, tit = item|—— active]; 
00, item|[active] = i, item|p] = iii; 
00, pos (ti) = active, pos (iti) = p; 

} 

else { 
00, nnp = set[ii + ss].i, size(ii) = ss; 
oo, set[ii + ss].i = opt, set[pp].2 = nnp; 
oo, ndlopt].loc = ti + ss,nd|[nnp|.loc = pp; 
updates ++; 

} 

} 
} 


This code is used in section 36. 
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38. At this point optp points to a node of an option that we want to remove from the current subproblem. 
We swap it out of the sets of all its items, except for the sets of inactive secondary items. (These have been 
purified, and we shouldn’t mess with their sets.) 


(Remove optp from its other sets, or return 0 38) = 


{ 
register int nn, i, iii, p, pp, ss, nnp; 
for (nn = optp; o,nd[nn — 1].itm > 0; nn——-) ; /* move to beginning of the option */ 
for (; 0, (% = nd[nn].ttm) > 0; nn++) { 
p = nd[nn].loc; 
if (p > second A (0, pos(iit)) > active) continue; /* ti already purified */ 
0, $8 = size(t#i) — 1; 
if (p < second) { 
if (00, ss < bound(ii) — slack(wi)) { 
if ((vbose & show_details) A level < show-choices.max / level > maal — show-choices_gap) { 
forintf (stderr, "Can’ tycover"); 
print.item_name (ii, stderr); 
fprintf (stderr ,", ysize="O"d, jbound="O"d, yslack="O"d, ju="O"d\n", ss, bound (ii), 
slack (ii), bound (ii) — slack (it)); 
} 


tough_itm = wi; 
forced = 0; 
return 0; /* abort the deletion, lest ii be wiped out */ 


} 
if (ss =0) { 
0, Wt = item|—— active]; 
0, pp = pos (it); 
if (vbose & show_details) { 
fprintf (stderr, "Empty,option,list ,,deactivating"); 
print_item_name (it, stderr ); 
fprintf (stderr, "\n"); 
i 
00, item|active] = ti, item[pp| = iii; 
00, pos (tt) = active, pos (ti) = pp; 


} 


i 

if (ss > 0) { 
0, nnp = set [ti + ss].i; 
0, size(wt) = 8s; 
oo, set [ti + ss]. = nn, set[p].c = nnp; 
oo, nd[nn].loc = it + ss,nd[nnp].loc = p; 
updates ++; 

i 

} 
} 


This code is used in section 37. 
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39. (Remove the option cur_choice 39) = 


{ 


register int 7, ii, ss, p, nnp; 


for ( ; 0, nd[cur_choice — 1].itm > 0; cur_choice——) ; /* move to beginning */ 
for ( ; 0, (4 = nd[cur_choice].itm) > 0; cur_choice++) { 
p = nd[cur_choice].loc; 
if (p > second A (0, pos(it)) > active) continue; /* tw inactive «/ 
0, 88 = size(ti) — 1; 
if (p < second) { 
if (00, 8s < bound(ii) — slack(wi)) { 
if ((vbose & show_details) A level < show choices.max / level > maal — show-choices.gap) { 
forintf (stderr, "\jcan’ tycover"): 
print_item_name (item |ii], stderr); 
fprintf (stderr, "\n"); 
} 


goto backup; 


‘i 
if (ss =0) { 
0, Wt = item|—— active]; 
0, pp = pos (it); 
if (vbose & show_details) { 
fprintf (stderr, "Null_move, deactivating"); 
print_item_name (it, stderr ); 
fprintf (stderr, "\n"); 
} 
00, item|active] = ti, item[pp| = tii; 
00, pos (tt) = active, pos (ti) = pp; 
} 
i 
if (ss >0) { 
00, nnp = set[ii + ss].i, size(a) = ss; 
oo, set [ti + ss].i = cur_choice, set[p].i = nnp; 
oo, nd[cur_choice].loc = tt + ss,nd[nnp].loc = p; 
updates ++; 
} 
} 
} 


This code is used in section 31. 
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40. Ifa weight becomes dangerously large, we rescale all the weights. 

(That will happen only when dwfactor isn’t 1.0. Adding a constant eventually “converges”: For example, 
if the constant is 1, we have convergence to 2!” after 2!’ — 1 = 16777215 steps. If the constant dw is .250001, 
convergence to 8.38861e+06 occurs after 25165819 steps!) 

(Note: I threw in the parameters dw and dwfactor only to do experiments. My preliminary experiments 
didn’t turn up any noteworthy results. But I didn’t have time to do a careful study; hence there might be 
some settings that work unexpectedly well. The code for rescaling might be flaky, since it hasn’t been tested 
very thoroughly at all.) 

#define dangerous 1-10°?p 
#define wmin 1-107°°p 


(Increase the weight of toughitm 40) = 
cmems += 2, 00, wt(tough_itm) += dw; 
if (vbose & show_record_weights \ wt(tough_itm) > maxwt) { 
maxwt = wt (tough_itm); 
fprintf (stderr, ""O"8.1£", mazrwt); 
print.item_name(tough_itm, stderr); 
fprintf (stderr, "\"O"11ld\n", nodes); 
} 
if (vbose & show_weight_bumps) { 
print_item_name(tough_itm, stderr); 
fprintf (stderr, "\Wwty"O".1£\n", wt (toughitm)); 
} 
dw *= dwfactor ; 
if (wt(tough_itm) > dangerous) { 
register int k; 
register float t; 
tmems = mems; 
for (k =0; k < itemlength; k++) 
if (0, item[k] < second) { 
o,t = wt(item[k]) «1 -10~7°p; 
0, wt (item|k]) = (t < wmin ? wmin : t); 


dw x= 1-107; 

if (dw < wmin) dw = wmin; 
w0 *=1-1072%p; 

if (w0 < wmin) w0 = wmin; 
cmems += mems — tmems; 


} 


41. (Maybe do a forced move 41) = 
while (forced) { 
0, best_itm = force|—— forced]; 
if (0, pos(best_itm) < active) { 
o, saved [stage] = saveptr; /* nothing placed on savestack */ 
(Do a forced move 47); 
} 
} 


This code is used in section 31. 
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42. (Subroutines 6) += 
void print_weights (void) 
{ 
register int k; 
for (k = 0; k < itemlength; k++) 
if (item|k] < second A wt(item[k]) 4 w0) { 
print_item_name (item |k], stderr ); 
fprintf (stderr ,"wty"O" .1£\n", wt (item|[k])); 
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43. The “best item” is considered to be an item that minimizes the branching degree. If there are several 
candidates, we choose the leftmost — unless we’re randomizing, in which case we select one of them at 
random. 

Consider an item that has four options {w, x,y,z}, and suppose its bound is 3. If the slack is zero, we’ve 
got to choose either w or xz, so the branching degree is 2. But if slack = 1, we have three choices, w or x or 
y; if slack = 2, there are four choices; and if slack > 3, there are five, including the “null” choice. 

In general, the branching degree turns out to be 1+ s — 6+ 1, where / is the length of the item, b is the 
current bound, and s is the minimum of b and the slack. This formula gives degree < 0 if and only if / is 
too small to satisfy the item constraint; in such cases we will backtrack immediately. (It would have been 
possible to detect this condition early, before updating all the data structures and increasing level. But that 
would make the downdating process much more difficult and error-prone. Therefore I wait to discover such 
anomalies until item-choosing time.) 

Let’s assign the score 1+ s—b+1 to each item. If two items have the same score, I prefer the one with 
smaller s, because slack items are less constrained. If two items with the same s have the same score, I 
(counterintuitively) prefer the one with larger b (hence larger !), because that tends to reduce the size of the 
final search tree. 

Consider, for instance, the following example taken from MDANCE: If we want to choose 2 options 
from 4 in one item, and 3 options from 5 in another, where all slacks are zero, and if the items are 
otherwise independent, it turns out that the number of nodes per level if we choose the smaller item first is 
(1,3,6,6-3,6-6,6-10). But if we choose the larger item first it is (1,3,6,10,10-3,10-6), which is smaller 
in the middle levels. 

Notice that a secondary item is active if and only if it has not been purified (that is, if and only if it hasn’t 
yet appeared in a chosen option). 


#define inf_size *7f£ffffff 
(Set best_itm to the best item for branching, and let score be its branching degree 43) = 


{ 
score = inf_size, tmems = mems; 
if ((vbose & show_details) A level < show_choices_max / level > maal — show_choices_gap ) 
forintf (stderr, "Level"O"d:\n", level); 
for (k =0; k < active; k++) 
if (0, item[k] < second) { 
0, 8 = slack (item|[k}); 
if (0,s > bound (item|k])) s = bound (item |[k]); 
0,t = size(item[k]) + s — bound (item[k]) + 1; 
if ((vbose & show_details) A level < show_choices_max / level > maal — show_choices_gap) { 
print_item_name (item |k], stderr); 
if (bound (item[k]) A1Vs 40) { 
fprintf (stderr, "("O"d:"O"d,"O"d)", bound (item [k]) — s, bound (item |k]), t); 


else fprintf(stderr ,"("O"d)",t); 


if (¢ =1A (bound (item[k]) — slack (item|k]) > 0)) 
for (int i = bound (item[k]) — slack (item|k]); i > 0; 1—) 0, force [forced ++] = item |k]; 
if (t < score) { 
if (t < score Vs < best_s V (s = best_s A size(item|k]) > best_l)) 
score = t, best_itm = item|k], best_s = s, best_l = size(item[k]),p = 1; 
else if (s = best_s A size(item[k]) = best_l) { 
pH; /*x This many items achieve the min */ 
if (randomizing \ (mems += 4, 7gb_unif_rand(p))) best_itm = item|k]; 
} 
} 
} 
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if ((vbose & show_details) A level < show_choices_maz / level > maal — show_choices_gap) { 


if (score = inf_size) fprintf(stderr, "\ysolution\n"); 
else if (forced) { 
fprintf (stderr, "\yfound_,"O"dforced\n", forced); 
for (int i =0; i < forced; i++) { 
print_item_name (force [i], stderr); 
forintf (stderr, "\n"); 


} 


else { 
fprintf (stderr, "joranching,,on"); 
print_item_name (best_itm, stderr ); 
fprintf (stderr ,"("O"d)\n", score); 


} 
if (score > maxdeg A score < inf_size \ aforced) maxdeg = score; 
if (shape_file) { 
if (score = inf_size) fprintf (shape_file, "sol\n"); 
else { 
fprintf (shape_file,"""O"'d", score); 
print_item_name (best_itm, shape-_file ); 
fprintf (shape_file,"\n"); 


fflush (shape_file); 
i 


cmems += mems — tmems; 


} 


This code is used in section 31. 


44, (Visit a solution and goto backup 44) = 
{ 
count H+; 
if (spacing A (count mod spacing =0)) { 
printf (""O"11d:\n", count); 
for (k =0; k < stage; k++) print_option (choice [levelstage [k]], stdout , 0); 
fflush (stdout); 


if (count > maxcount) goto done; 
goto backup; 


} 


This code is used in section 31. 


§43 
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45. (Save the currently active items and their sizes and bounds 45) = 
o, saved |stage] = saveptr; 
if (saveptr + active > maxsaveptr) { 
if (saveptr + active > savesize) { 
forintf (stderr, "Stack, overflow, (savesize="O"d) !\n", savesize ); 


exit (—5); 
} 
maxsaveptr = saveptr + active; 
} 
for (p = 0; p< active; pt) { 


000, savestack |saveptr + p].l = item [p|, savestack|[saveptr + p].r = size (item|[p}); 
if (item|[p] < second) oo, boundsstack|saveptr + p| = bound (item [p]); 


saveptr += active; 


This code is used in section 31. 


46. (Restore the currently active items and their sizes and bounds 46) = 

o, active = saveptr — saved |stage]; 
saveptr = saved|stage]; 
for (p = 0; p< active; pt) { 

00, size(savestack |saveptr + p|.l) = savestack|saveptr + p].r; 

if (0, savestack|saveptr + p].l < second) 

00, bound (savestack |saveptr + p|.l) = boundsstack[saveptr + p]; 

‘ 


This code is used in section 31. 


47. A forced move occurs when best_itm has bound-slack remaining options. In this case we can streamline 
the computation, because there’s no need to save the current active sizes and bounds. (They won’t be looked 
at.) 


(Do a forced move 47) = 


{ 


if ((ubose & show_choices ) A level < show_choices.maz) fprintf(stderr," (forcing) \n"); 
score = 1; 
goto advance; 


} 


This code is used in sections 31 and 41. 


48. (Subroutines 6) += 
void print_savestack (int start, int stop) 


{ 
int k; 
for (k = start; k < stop; k++) { 
print_item_name (savestack [k].1, stderr); 
forintf (stderr, "("O"d) ,y"O"ay"O" d\n", savestack |k].1, savestack [k].r, boundsstack [k]); 
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49. (Subroutines 6) += 
void print_state (void ) 
{ 
register int [, s; 
forintf (stderr, "Currentystate,(level,"O"d) :\n", level); 
for (J =0; | < level; I++) { 
if (levelstage|stagelevel [l]] A 1) fprintf(stderr, "~"); 
print_option (choice [I], stderr, —1); 
fprintf (stderr, "\(of4"O"d) \n", deg [I]); 
if (1 > show_levels.max) { 
forintf (stderr, "4W...\n"); 
break; 
} 
forintf (stderr, "\y"O"1ldjsolutions,."O"1ld.mems, ,,and,max,jlevel,"O"dsojfar.\n", count, 


mems, mazl); 


} 
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50. During a long run, it’s helpful to have some way to measure progress. The following routine prints a 
string that indicates roughly where we are in the search tree. The string consists of node degrees, preceded 
by ‘~’ if the node wasn’t the current node in its stage (that is, if the node represents an option that has 


already been fully explored — “we’ve been there done that”). 


Following that string, a fractional estimate of total progress is computed, based on the naive assumption 
that the search tree has a uniform branching structure. If the tree consists of a single node, this estimate is .5. 
Otherwise, if the first choice is the kth choice in stage 0 and has degree d, the estimate is (k —1)/(d+k-1) 
plus 1/(d+k-—1) times the recursively evaluated estimate for the kth subtree. (This estimate might obviously 


be very misleading, in some cases, but at least it tends to grow monotonically.) 


(Subroutines 6) += 
void print_progress (void) 
{ 
register int |, ll, k, d, c, p, ds =0; 
register double f, fd; 


fprintf (stderr, "after,"O"1lld mems:."O"1ldsols,", mems, count); 


for (f = 0.0, fd = 1.0,1 =0; I < level; I++) { 
if (1 < show_levels_maz ) 


fprintf (stderr, "\"O"s"O"d", levelstage[stagelevel[l]] =1?"": 


if (levelstage|stagelevel [l]] =1) { 


for (k = 1,d= deg[l], lJ =1—1; Il > 0A stagelevel [ll] = stagelevel|l]; k++,d++,ll—) ; 


fd x=d, f += (k —1)/fd; /* choice | is treated like k of d «/ 


if (1 > show_levels.maz A 7ds) ds = 1, fprintf (stderr,"..."); 


} 
fprintf (stderr, "\"O" .5£\n", f +0.5/fd); 


51. (Print the profile 51) = 


{ 


fprintf (stderr, "Profile:\n"); 


} 


This code is used in section 2. 


for (k =0; k < mazs; k++) fprintf(stderr ,""O"3d:"O"1ld\n", k, profile [k]); 
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52. Index. 

active: 9, 13, 14, 22, 28, 36, 37, 38, 39, 41, 
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d: 50. 


dangerous: AO. 
deg: 30, 31, 49, 50. 


delta: 3, 4, 33. 

done: 2, 31, 33, 44. 

ds: 50. 

dw: 3, 4, 8, 40. 

dwfactor: 3, 4, 40. 

exit: 4, 12, 15, 34, 35, 45. 
f: 8, 50 
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fd: 50. 

fflush: 438, 44. 

fgets: 15, 19. 

fopen: 4. 

force: 9, 31, 41, 43. 

forced: 9, 31, 37, 38, 41, 43. 
forward: 31. 

forintf: 2, 4, 6, 11, 12, 18, 14, 15, 19, 25, 26, 


27, 31, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 
45, 47, 48, 49, 50, 51. 

gb-init_rand: 4. 

gb-rand: 3. 

gb_unif_rand: 28, 43. 

a: 2, 8, 14, 43. 

uw: 36, 37, 38, 39. 

wi: 36, 37, 38, 39. 
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imems: 2, 3. 

include_option: 31, 36. 

inf_size: 31, 43. 

tpropcount: 8, 15, 18, 20, 21, 22, 23. 

isspace: 15, 16, 19. 

istage: 2, 16. 

item: 8, 9, 13, 14, 22, 23, 24, 25, 27, 28, 36, 37, 


itemlength: 
atm: 


ue 
k: 
l: 


last_itm: 

last_node: 
level: 
levelstage: 


ll: 


Iname: 
loc: 


Ir: 
m: 


38, 39, 40, 42, 43, 45 
2, 9, 13, 14, 22, 26, 27, 40, 42. 
8, 12, 14, 19, 20, 21, 24, 30, 36, 38, 39. 


10, 14, 49, 50. 
9, 15, 18, 20, 22. 

2, 9, 12, 14, 19, 20, 21, 24, 26. 
30, 31, 32, 35, 37, 38, 39, 43, 47, 49, 50. 
30, 31, 35, 44, 49, 50. 
50. 
8, 11, 15, 18, 20, 23. 

8, 12, 14, 19, 20, 24, 36, 37, 38, 39. 
10, 11, 15, 18, 19, 20. 
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main: 2. 


max_cols : 
max_level: 
max_nodes : 
maz_stage: 
maxcount: 
maxdeg : 
maxextra: 
maal: 
maxs: 
mazxsaveptr : 
maxwt : 
mems: 
mod: 
namebuf : 


nb: 
nd: 


nn: 


nnp: 
node: 


1, 9, 15, 16, 22 
1,730;32. 35, 
16.9 19,.20; 
1, 30, 34. 
3, 4, 44. 
9. 3, Ad, 
8, 9. 
2, 3, 35, 37, 38, 39, 43, 49. 
3, 34, 51. 
2, 3, 45. 
3, 40. 
1, 2, 3, 28, 33, 40, 43, 49, 50 
1, 44. 
10, 11, 15, 16, 17, 18, 19, 20. 
23. 
8, 9, 12, 14, 19, 20, 21, 24, 36, 37, 38, 39. 
36, 38. 
36, 37, 38, 39. 
2, 8, 9. 


node_struct: 8. 


nodes: 


O: 


O: 


2, 3, 31, 40. 
ft: 
i 


oactive: 9. 


ok: 


oo: 


ooo: 


14. 
1, 15, 21, 22, 23, 28, 31, 35, 37, 38, 39 
40, 45, 46. 

1, 45. 
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opt: 36, 37. 

options: 3, 19, 26. 

optp: 36, 37, 38. 

osecond: 9, 22, 23, 26. 

p: 2, 12, 13, 36, 38, 39, 50. 

panic: 15, 16, 17, 18, 19, 20. 

pos: 8, 18, 14, 20, 21, 23, 28, 36, 37, 38, 39, 41. 
pp: 2, 17, 19, 20, 36, 37, 38, 39. 


primextra: 8, 13, 22. 

printitemname: 11, 12, 13, 14, 25, 37, 38, 39, 
40, 42, 43, 48. 

printitems: 14. 

printitm: 13, 14. 

print_option: 12, 36, 44, 49. 

print_progress: 33, 50. 


print_savestack: 48. 

print_state: 33, 49. 

print.weights: 2, 42. 

printf: 4A. 

profile: 30, 31, 51. 

prow: 12, 13. 

purgeit: 31. 

q: 2, 12, 14. 

qq: 14. 

r: 2, 10, 14. 

random_seed : A, 

randomizing: 3, 4, 43. 

rname: 8, 11, 15, 18, 20, 23. 

s: 2, 36, 49. 

sanity: 14, 31. 

sanity.checking: 2, 14, 31. 

saved: 31, 32, 41, 45, 46. 

saveptr: 31, 32, 41, 45, 46. 

savesize: 1, 32, 45. 

savestack: 1, 3, 32, 41, 45, 46, 48. 

score: 2, 31, 43, 47. 

second: 9, 13, 15, 16, 19, 20, 22, 23, 27, 36, 37, 
38, 39, 40, 42, 43, 45, 46. 

secondextra: 8, 22. 

set: 8,9, 10, 13, 14, 15, 23, 24, 31, 37, 38, 39. 

setlength: 2, 9, 13, 22. 

shape_file: 3, 4, 5, 43. 

shape.name: 3, 4. 

show_basics: 1, 2, 3. 

show_choices: 1, 3, 25, 31, 36, 47. 

show_choices_gap: 3, 4, 37, 38, 39, 43. 

show_choices.max: 3, 4, 37, 38, 39, 43, 47. 

show_details: 1, 3, 37, 38, 39, 43. 


3, 
2, 


show-_final.weights: 1, 2. 
show-_full_state: 1, 33. 
show_levels.maz: 3, 4, 49, 50. 


show_max_deg: 1, 2. 
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show-profile: 1, 2, 31. 

show-record_weights: 1, 40. 

show_tots: 1, 2. 

show_warnings: 1, 3, 19. 

show_weight_bumps: 1, 40. 

showpos: 12. 

size: 8, 12, 13, 14, 20, 21, 22, 23, 27, 37, 38, 
39, 43, 45, 46. 

slack: 8, 13, 15, 23, 37, 38, 39, 48. 

spacing: 3, 4, 44. 

spr: 8, 19. 

ss: 36, 37, 38, 39. 

sscanf: 4. 

stage: 30, 31, 34, 35, 36, 41, 44, 45, 46. 

stagelevel: 30, 35, 49, 50. 

start: 48. 


startname: 16 


stderr: 1, 2, 3, 4, 6, 12, 13, 14, 15, 19, 25, 26, 


27, 31, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 
45, 47, 48, 49, 50, 51. 


stdin: 15, 19. 

stdout: 44. 

stop: 48. 

str: 10, 11, 16, 17, 19. 

stream: 11, 12. 

stringbuf: 10. 

strlen: 15, 19. 

subroutine_overhead: 1, 36. 

t: 2, 40. 

tetrabyte: 8, 9. 

thresh: 3, 4, 33. 

timeout: 3, 4, 33. 

tmems: 3, 40, 48. 

tough_itm: 32, 36, 37, 38, 40. 

tryagain: 31. 

twoints: 2, 10, 32. 

uint: 2. 

ullng: 2, 3, 30. 

updates: 2, 3, 37, 38, 39. 

ubose: 1, 2, 3, 4, 19, 25, 31, 33, 36, 37, 38, 
39, 40, 43, 47. 

wmin: AO. 

wt: 8, 13, 23, 40, 42. 

w0: 3, 23, 40, 42. 

a: 12, 14. 
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(Adjust nd 24) Used in section 19. 

(Check for duplicate item name 18) Used in section 16. 

(Close the files 5) Used in section 2. 

(Convert the prefix to an integer, q 17) Used in section 16. 

(Cover or commit item ii, decrease bound of item % if primary, potentially deactivate it, or return 0 37) 
Used in section 36. 

(Create a node for the item named in buf [p] 20) Used in section 19. 

(Do a forced move 47) Used in sections 31 and 41. 

(Do special things if enough mems have accumulated 33) Used in section 31. 

(Expand set 23) Used in section 19. 

(Global variables 3, 9, 30,32) Used in section 2. 

(Increase the weight of tough_itm 40) 

(Increase level 35) Used in section 31. 

(Increase stage 34) Used in section 31. 

(Initialize item 22) Used in section 19. 

(Input the item names 15) Used in section 2. 

(Input the options 19) Used in section 2. 

(Maybe do a forced move 41) Used in section 31. 

(Print the profile 51) Used in section 2. 

(Process the command line 4) Used in section 2. 

(Randomize the item list 28) Used in section 2. 

(Remove the option cur_choice 39) Used in section 31. 

(Remove last_node from its item list 21) Used in section 19. 

(Remove optp from its other sets, or return 0 38) Used in section 37. 

(Report an uncoverable item 25) Used in section 2. 

(Report the item totals 27) Used in section 2. 

(Report the successful completion of the input phase 26) Used in section 2. 

(Restore the currently active items and their sizes and bounds 46) Used in section 31. 

(Save the currently active items and their sizes and bounds 45) Used in section 31. 

(Scan an item name, possibly prefixed by bounds 16) Used in section 15. 

(Set best_itm to the best item for branching, and let score be its branching degree 43) Used in section 31. 

(Solve the problem 31) Used in section 2. 

(Subroutines 6, 11, 12, 13, 14, 36, 42, 48, 49, 50) Used in section 2. 

(Type definitions 8,10) Used in section 2. 

( Visit a solution and goto backup 44) Used in section 31. 
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